Agnostic System Identification for Monte Carlo Planning

نویسنده

  • Erik Talvitie
چکیده

While model-based reinforcement learning is often studied under the assumption that a fully accurate model is contained within the model class, this is rarely true in practice. When the model class may be fundamentally limited, it can be difficult to obtain theoretical guarantees. Under some conditions the DAgger algorithm promises a policy nearly as good as the plan obtained from the most accurate model in the class, but only if the planning algorithm is near-optimal, which is also rarely the case in complex problems. This paper explores the interaction between DAgger and Monte Carlo planning, specifically showing that DAgger may perform poorly when coupled with a sub-optimal planner. A novel variation of DAgger specifically for use with Monte Carlo planning is derived and is shown to behave far better in some cases where DAgger fails.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Radial dose functions of GZP6 intracavitary brachytherapy 60Co sources: treatment planning system versus Monte Carlo calculations

Background: The Monte Carlo (MC) method is not only used for dose calculations around brachytherapy sources but also for benchmarking treatment planning systems (TPS) calculations. Materials and Methods: Three 60Co sources of GZP6 brachytherapy unit were simulated using MCNP4C MC Code. The radial dose functions were calculated by MC method and GZP6 TPS and were compared. Results: There was a go...

متن کامل

Evaluation of Lung Dose in Esophageal Cancer Radiotherapy Using Monte Carlo Simulation

Background and purpose: Radiation therapy make an important contribution in the control and treatment of cancers. Lungs are the main organs at risk in esophageal cancer radiotherapy. Difference between the dose distribution due to the treatment planning system (TPS) and the patient's body dose is dependent on the calculation of the treatment planning system algorithm, which is more pronounced i...

متن کامل

Evaluation of the RtDosePlan Treatment Planning System using Radiochromic Film and Monte Carlo Simulation

Introduction: GafChromic EBT films are one of the self-developing and modern films commercially available for dosimetric verification of treatment planning systems (TPSs). Their high spatial resolution, low energy dependence and near-tissue equivalence make them suitable for verification of dose distributions in radiation therapy. This study was designed to evaluate the dosimetric parameters of...

متن کامل

Dose Calculations for Lung Inhomogeneity in High-Energy Photon Beams and Small Beamlets: A Comparison between XiO and TiGRT Treatment Planning Systems and MCNPX Monte Carlo Code

Introduction Radiotherapy with small fields is used widely in newly developed techniques. Additionally, dose calculation accuracy of treatment planning systems in small fields plays a crucial role in treatment outcome. In the present study, dose calculation accuracy of two commercial treatment planning systems was evaluated against Monte Carlo method. Materials and Methods Siemens Once or linea...

متن کامل

Dosimetric characteristics of 137Cs sources used in after loading Selectron system by Monte Carlo method

Background: For an effective treatment planning in brachytherapy, it is necessary to know the accurate source dosimetric information such as air kerma strength, exposure rate constant, dose rate constant and redial dose distribution. The usual method to determine these factors is thermo luminescent dosimeter (TLD) dosimetry. Nowadays, another more accurate method is known to be the Monte Carlo ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015